The ISLE Corpus: Italian and German Spoken Learners English
نویسندگان
چکیده
Background: ISLE project aims Project ISLE (Interactive Spoken Language Education) aimed to exploit available speech recognition technology to improve the performance of computerbased English language learning systems, specifically for adult German and Italian learners of English. The English language teaching industry is showing increasing interest in and awareness of the relevance and potential of speech and language technology (Atwell 1999). The project conducted a detailed survey and analysis of prospective user requirements (Atwell et al. 2000): we sought expert advice and opinions from a range of prospective end-users (learners of English as a second language), as well as meta-level experts or professionals and practitioners in English language teaching (ELT teachers and researchers) and industry experts in the ELT market (publishers of ELT resources, textbooks and multimedia). The ISLE project partners included representative users, English language learners at all six sites in the ISLE project consortium: Dida*el S.r.l. (Milan, Italy), Entropic Cambridge Research Laboratory Ltd. (Cambridge, UK), Ernst Klett Verlag (Stuttgart, Germany), University of Hamburg (Germany), University of Leeds (UK), University of Milan Bicocca (Italy). Leeds University is a centre for English language teaching and research; Leeds University, Hamburg University and Entropic Cambridge had ready access to overseas students from Germany and Italy; Klett is a major German publisher of ELT resources and textbooks; and Dida*el is a major Italian publisher of multimedia educational systems. We developed a demonstrator English pronunciation tutor system, including an error diagnosis module to pinpoint and flag mispronounced words in a learners spoken input (Herron et al. 1999).
منابع مشابه
The ISLE Corpus of Non-Native Spoken English
For the purpose of developing pronunciation training tools for second language learning a corpus of non-native speech data has been collected, which consists of almost 18 hours of annotated speech signals spoken by Italian and German learners of English. The corpus is based on 250 utterances selected from typical second language learning exercises. It has been annotated at the word and the phon...
متن کاملA Corpus-based Analysis of Collocational Errors in the Iranian EFL Learners' Oral Production
Collocations are one of the areas generally considered problematic for EFL learners. Iranian learners of English like other EFL learners face various problems in producing oral collocations. An analysis of learners' spoken interlanguage both indicates the scope of the problem and the necessity to spend more time and energy by learners on mastering collocations. The present study specifically f...
متن کاملIs non-native pronunciation modelling necessary ?
It is difficult to recognize accented or non-native speech with speech recognition systems that are trained using native speech. While standard acoustic speaker adaptation techniques are often applied in these cases, they can only reduce the recognition errors that are due to mispronunciations on the phoneme level. They are not able to handle severe deviations from the expected pronunciation. A...
متن کاملSpoken English Learner Corpora
In this paper we present a survey of some most significant spoken English learner corpora created up to date. Spoken learner corpora which include speech generated by learners are important in many areas of research and practice, in particular, for identifying typical pronunciation errors of learners of English as a second language (ESL), English as a foreign language (EFL), or English as a lin...
متن کاملKorean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
This paper introduces a corpus of Korean-accented English speech produced by children (the Korean Children’s Spoken English Corpus: the KC-SEC), which is constructed by Seoul National University. The KC-SEC was developed in support of research and development of CALL systems for Korean learners of English, especially for elementary school learners. It consists of read-speech produced by 96 Kore...
متن کامل